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Abstract 



Inferring probabilistic networks from data is a notoriously difficult task. Under various goodness-of-fit mea- 
sures, finding an optimal network is NP-hard, even if restricted to polytrees of bounded in-degree. Polynomial- 
time algorithms are known only for rare special cases, perhaps most notably for branchings, that is, polytrees 
in which the in-degree of every node is at most one. Here, we study the complexity of finding an optimal 
polytree that can be turned into a branching by deleting some number of arcs or nodes, treated as a parame- 
ter. We show that the problem can be solved via a matroid intersection formulation in polynomial time if the 
number of deleted arcs is bounded by a constant. The order of the polynomial time bound depends on this 
(N. constant, hence the algorithm does not establish fixed-parameter tractability when parameterized by the number 

of deleted arcs. We show that a restricted version of the problem allows fixed-parameter tractability and hence 
scales well with the parameter. We contrast this positive result by showing that if we parameterize by the number 
of deleted nodes, a somewhat more powerful parameter, the problem is not fixed-parameter tractable, subject to 
00 ' a complexity-theoretic assumption. 

o: 

1 Introduction 

There has been extensive research on learning probabilistic networks from data by maximizing some suitable 
scoring function. Edmonds (1967) gave an efficient algorithm for the class of branchings, that is, directed forests 
with in-degree at most one; the algorithm was discovered independently by Chu and Liu (1965), and it has been 
later simplified and expedited by others Bock (1971); Camerini, Fratta, and Maffioli (1979); Fulkerson (1974); 
Gabow et al. (1986); Gabow, Galil, and Spencer (1989); Karp (1971); Tarjan (1977). Chickering (1996) showed 
that for general directed acyclic graphs, DAGs, the problem is NP-hard even if the in-degree is at most two. 
Motivated by this gap, Dasgupta (1999) asked for a network class that is more general than branchings yet 
admitting provably good structure-learning algorithms; his findings concerning polytrees, that is, DAGs without 
undirected cycles, were however rather negative, showing that the optimization problem is NP-hard even if the 
in-degree is at most two. 

Given the recent advances in exact exponential algorithms in general (see, e.g., the book by Fomin and 
Kratsch (2010)), and in finding optimal DAGs in particular, it is natural to ask, whether "fast" exponential-time 
algorithms exist for finding optimal polytrees. For general DAGs the fastest known algorithms run in time within 
a polynomial factor of 2™, where n is the number of nodes Koivisto and Sood (2004); Ott and Miyano (2003); 
Parviainen and Koivisto (2009); Silander and Myllymaki (2006). However, it is not clear, whether even these 
bounds can be achieved for polytrees; a brute-force algorithm would visit each polytree one by one, whose num- 
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ber scales as the number of directed labelled trees n™~ 2 2™ _1 Cayley (1889). Do significantly faster algorithms 
exist? Does the problem become easier if only a small number of nodes are allowed an in-degree larger than one? 

In this work, we take a first step towards answering these questions by considering polytrees that differ from 
branchings by only a few arcs. More precisely, we study the problem of finding an optimal k-branching, defined 
as a polytree that can be turned into a branching by deleting fc arcs. We make the standard assumption that the 
scoring function decomposes into a sum of local scores; see the next section for precise definitions. We note 
that fc-branchings generalize branchings in a different direction than the Tree-augmented Naive Bayes classifier 
(TAN) due to Friedman, Geiger, and Goldszmidt (1997). Namely, in a TAN the in-degree of each node is at most 
two, and there is a designated class node of in-degree zero, removing of which leaves a spanning tree; the tree is 
undirected in the sense that the symmetric conditional mutual information is employed to score arcs. 

Polynomial-time result for fc-branchings Our main result is an algorithm that finds an optimal fc-branching 
in polynomial time for every constant k. (See the next section for a formal definition of the problem.) Our over- 
all approach is straightforward: we search exhaustively over all possible sets of at most k "extra arcs", fix the 
guessed arcs, and solve the induced optimization problem for branchings. Implementing this seemingly innocent 
algorithm, however, requires successful treatment of certain complications that arise when applying the existing 
matroid machinery for finding optimal branchings. In particular, one needs to control the interaction of the extra 
arcs with the solution from the induced subproblem. 

Fixed-parameter tractability Our algorithm for the fc-branching is polynomial for fixed k, but the degree of 
the polynomial depends on k, hence the algorithm does not scale well in k. We therefore investigate variants 
of the fc-branching problem that admit fixed-parameter tractability in the sense of Downey and Fellows (1999): 
the running time bound is given by a polynomial whose degree is independent of the parameter, the parameter 
contributing a constant factor to the bound. 

In particular, we show that the fc-branching problem is fixed-parameter tractable if the set of arcs incident to 
nodes with more than one parent form a connected polytree with exactly one sink, and each node has a bounded 
number of potential parent sets. This result is interesting as we show that the fc-branching problem remains NP- 
hard under these restrictions. 

We complement the fixed-parameter tractability result by showing that more general variants of the fc-branching 
problem are not fixed-parameter tractable, subject to complexity theoretic assumptions. In particular, we show 
that the fc-branching problem is not fixed-parameter tractable when parameterized by the number of nodes whose 
deletion produces a branching. 

2 The fc-branching problem 

A probabilistic network is a multivariate probability distribution that obeys a structural representation in terms 
of a directed graph and a corresponding collection of univariate conditional probability distributions. For our 
purposes, it is crucial to treat the directed graph explicitly, whereas the conditional probabilities will enter our 
formalism only implicitly. Such a graph is formalized as a pair (TV, A), where N is the node set and A C N x N 
is the arc set; we identify the graph with the arc set A when there is no ambiguity about the node set. A node u is 
said to be a parent of v in the graph if the arc (u, v) is in A; we denote by A v the set of parents of v. When our 
interest is in the undirected structure of the graph, we may denote by A the skeleton of A, that is, the set of edges 
{ {u, v} : (u, v) G A }. For instance, we call A a polytree if A is acyclic, and a branching if additionally each 
node has at most one parent. 

When learning a probabilistic network from data it is customary to introduce a scoring function that assigns 
each graph A a real-valued score f(A) that measures how well A fits the data. While there are plenty of alterna- 
tive scoring functions, derived under different statistical paradigms and assumptions Lam and Bacchus (1994); 
Chickering (1995); Heckerman, Geiger, and Chickering (1995); Dasgupta (1999), the most popular ones share 
one important property: they are decomposable, that is, 

f{A) = Y,fv{A v ), 
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Figure 1 : An optimal polytree for a given scoring function. 



with some "local" scoring functions /„. The generic computational problem is to maximize the scoring function 
over some appropriate class of graphs given the local scoring functions as input. Note that the score f v (A v ) need 
not be a sum of any individual arc weights, and that the parent set A v may be empty. Figure 1 shows a table 
representing a local scoring function /, together with an optimal polytree. 

We study this problem by restricting ourselves to a graph class that is a subclass of polytrees but a superclass 
of branchings. We call a polytree A a k-branching if there exists a set of at most k arcs D C A such that in A \ D 
every node has at most one parent. Note that any branching is a 0-branching. The k-branching problem is to find 
a fc-branching A that maximizes f(A), given the values f v (A v ) for each node v and some collection of possible 
parent sets A v C TV \ {v}. 

3 An algorithm for the k -branching problem 

Throughout this section we consider a fixed instance of the /c -branching problem, that is, a node set N and scoring 
functions /„ for each v € N. Thus all arcs will refer to elements of N x N. We will use the following additional 
notation. If A is an arc set, then H(A) denotes the heads of the arcs in A, that is, the set { v : (u, v) e A }. If C 
is a set of edges, then N(C) denotes the induced node set { u, v : {u, v} e C }. 

We present an algorithm that finds an optimal k -branching by implementing the following approach. First, 
we guess an arc set D of size at most k. Then we search for an optimal polytree A that contains D such that in 
A \ D every node has at most one parent; in other words, B = A \ D is an optimal branching with respect to an 
induced scoring function. Clearly, the set D must be acyclic. The challenge is in devising an algorithm that finds 
an optimal branching B that is disjoint from D while guaranteeing that the arcs in D will not create undirected 
cycles in the union BUD. To this end, we will employ an appropriate weighted matroid intersection formulation 
that extends the standard formulation for branchings. 

We will need some basic facts about matroids. A matroid is a pair (E, X), where E is a set of elements, called 
the ground set, and X is a collection of subsets of E, called the independent sets, such that 

(Ml) G X; 

(M2) if A C B and B E X then A E X; and 

(M3) if A, B e X and \ A\ < \B\ then there exists an e e B \ A such that A U {e} G X. 

The rank of a matroid is the cardinality of its maximal independent sets. Any subset of E that is not indepen- 
dent is called dependent. Any minimal dependent set is called a circuit. 

The power of matroid formulations is much due to the availability of efficient algorithms Brezovec, Cornuejols, 
and Glover (1986); Edmonds (1970, 1979); Frank (1981); Iri and Tomizawa (1976); Lawler (1976) for the 
weighted matroi d intersection problem, defined as follows. Given two matroids M\ = (E,X\) andM2 = {E,!^), 
and a weight function w : E — > R, find an I C E that is independent in both matroids and maximizes the total 
weight of /, that is, w(I) = J2eei w i e )- The complexity of the fastest algorithm we are aware of (for the general 
problem) is summarized as follows. 
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Theorem 1 (Brezovec, Cornuejols, and Glover, 1986). The weighted matroid intersection problem can be solved 
in 0(mr(r + c + log m)) time, where m — \E\, r is the minimum of the ranks of M\ and M 2 , and c is the time 
needed for finding the circuit of I U {e} in both M\ and M 2 where e G E and I is independent in both Mi and 
M 2 . 

We now proceed to the specification of two matroids, Mi (S) = (N x N,li(S)) and M 2 (N x N,1 2 (S)), 
parametrized by an arbitrary arc set S such that S is acyclic. 

The in-degree matroid M\ (S): Let I\ (S) consist of all arc sets B such that no arc in B has a head in H (S) and 
every node outside H(S) is the head of at most one arc in B. 

The acyclicity matroid M 2 (S): Let 12(5') consist of all arc sets B such that B U 5* is acyclic. 

We observe that the standard matroid intersection formulation of branchings is obtained as the special case of 
S = 0: then an arc set is seen to be branching if and only if it is independent in both the in-degree matroid and 
the acyclicity matroid. 

The next two lemmas show that Mi(S) and M 2 (S) are indeed matroids whenever S is acyclic. 
Lemma 2. Mi (S) is a matroid. 

Proof. Fix the arc set S and denote li (5) by T\ for short. Clearly, <G Ii and if A C B and B G 1\ then also 
A G Ii. Consequently, M X {S) satisfies (Ml) and (M2). To see that Mi (5") satisfies (M3) let A, B e Ii with 
| A | < \B\. Because of the definition of Mi(S') the sets A and B contain at most one arc with head v, for every 
v E N \ H(S). Because \ A\ < \B\ there is a node v G N \ H(S) such that v is the head of an arc in B but v is 
not the head of an arc in A. Let e G B be the arc with head v. Then e G B \ A and A U {e} G I\. Hence, Mi (S) 
satisfies (M3). □ 

Lemma 3. M 2 (S) is a matroid. 

Proof. Fix the arc set S and denote I 2 (S) by X 2 for short. Because the skeleton S is acyclic and acylicity is a 
hereditary property (a graph property is called hereditary if it is closed under taking induced subgraphs) it follows 
that G 1 2 and if A C B and B E 1 2 then also A G 1 2 . Consequently, M 2 {S) sa tisfies ( Ml) and ( M2). To see 
that M 2 (S) satisfies (M3) let A, B G 1 2 with \ A\ < \B\. Consider the sets A' = A U S and B' = BUS. Let 
C be a connected subset of A'. Because both A' and B' are acyclic, it follows that the number of edges of B' 
with both endpoints in N(C) is at most the number of edges of A' with both endpoints in N(C). Because every 
edge in A' \ S corresponds to an arc in A and similarly every edge in B 1 \ S corresponds to an arc in B and 
\A\ < \B\,it follows that there is an arc e G B \ A whose endpoints are contained in two distinct components of 
A'. Consequently, the set A' U {e} is acyclic and hence AL) {e} <E X 2 . □ 

We now relate the common independent sets of these two matroids to fc-branchings. If A is a fc-branching, we 
call an arc set D a deletion set of A if D is a subset of A, contains at most k arcs, and in A \ D every node has at 
most one parent. 

Lemma 4. Let A be an arc set and D a subset of A of size at most k such that no two arcs from D' — { (u, v) G 
A \ D : v G H(D) } have the same head and such that S is acyclic, where S — D U D' . We have that A is a 
k-branching with deletion set D if and only if A\S is independent in both Mi (S) and M 2 (S). 

Proof. (=>) : Suppose A is a fc-branching with deletion set D. Then A \ D is a branching, which shows that 
every node v outside H (S) has in-degree at most one in A \ S. Since by definition all arcs with a head in H (S) 
are contained in S, no arc in A \ S has a head in H(S). Therefore, A \ S is independent in Mi(S'). Since every 
fc-branching is a polytree, (A \ S) U S — A is acyclic, and th erefore A \ S is independent in M 2 (S). 

(<=) : Since A \ S is independent in M 2 (S), we have that (A \ S) U S = A is acyclic. Thus, A is a polytree. 
As A \ S is independent in Mi (S), every node outside H (S) has in-degree at most one in A \ S and every node 
from H(S) has in-degree zero in A \ S. Since the head of every arc from D' is in ^(5*) and no two arcs from D' 
have a common head, (A \ S) U D 1 = A\D has maximum in-degree at most one. Because \D\ < fc, we have 
that A is a fc-branching with deletion set D. □ 
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Figure 2: Left: the two guessed arc sets D (dotted) and D' (dashed). Right: the arc set A (solid) that is a heaviest 
common independent set of the two matroids M\(S) and M2(S). 

The characterization of Lemma 4 enables the following algorithm for the fc-branching problem. Define the 
weight function by letting w(u,v) — f v ({u}) — f v ($) for all arcs (u,v). Guess the arc sets D and D', put 
S = D U D', check that S is acyclic, find a maximum-weight set B that is independent in both M\(S) and 
M 2 (5); output a fc-branching A = BU S that yields the maximum weight over all guesses D and D', where the 
weight of B U S is obtained as 

w(B)+ ]T (/„(£,)- /„(0)) . 

u€if(S) 

It is easy to verify that maximizing this weight is equivalent to maximizing the score f(A). Figure 2 illustrates 
the algorithm for the scoring function of Figure 1 . 

It remains to analyze the complexity of the algorithm. Denote by n the number of nodes. For a moment, 
consider the arc set S fixed. To apply Theorem 1, we bound the associated key quantities: the size of the ground 
set is 0(n 2 ); the rank of both matroids is clearly 0(n); circuit detection can be performed in 0(n) time, by a 
depth-first search for M\ (S) and by finding a node that has higher in-degree than it is allowed to have in M 2 {S). 
Thus, by Theorem 1, a maximum-weight set that is independent in both matroids can be found in 0(n A ) time. 
Then consider the number of possible choices for the set S = DUD' . There are 0(n 2k ) possibilities for choosing 
a set D of at most fc arcs such that D is acyclic. For a fixed D, there are 0(n k ) possibilities for choosing a subset 
D' C N x H(D) such that DUD' is acyclic and no two arcs from D' have the same head. Thus there are 0(n 3k ) 
relevant choices for the set S. 

We have shown the following. 

Theorem 5. The k-branching problem can be solved in 0(n 3fc+4 ) time. 

4 Fixed-parameter tractability 

Theorem 5 shows that the fc-branching problem can be solved in "non-uniform polynomial time" as the order of 
the polynomial time bound depends on fc. In this section we study the question of whether one can get fc "out of 
the exponent" and obtain a uniform polynomial-time algorithm. 

The framework of Parameterized Complexity Downey and Fellows (1999) offers the suitable tools and meth- 
ods for such an investigation, as it allows us to distinguish between uniform and non-uniform polynomial-time 
tractability with respect to a parameter. An instance of a parameterized problem is a pair (I, fc) where I is the 
main part and fc is the parameter; the latter is usually a non-negative integer. A parameterized problem is fixed- 
parameter tractable if there exist a computable function / and a constant c such that instances (7, fc) of size n can 
be solved in time 0{f(k)n c ). FPT is the class of all fixed-parameter tractable decision problems. Fixed-parameter 
tractable problems are also called uniform polynomial-time tractable because if fc is considered constant, then in- 
stances with parameter fc can be solved in polynomial time where the order of the polynomial is independent of 
fc (in contrast to non-uniform polynomial-time running times such as n k ). 

Parameterized complexity offers a completeness theory similar to the theory of NP-completeness. One uses 
parameterized reductions which are many-one reductions where the parameter for one problem maps into the 
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parameter for the other. More specifically, problem L reduces to problem L' if there is a mapping R from instances 
of L to instances of V such that (i) (7, fc) is a yes-instance of L if and only if (/', fc') = R(I, k) is a yes-instance 
of L', (ii) k' < g(k) for a computable function g, and (iii) R can be computed in time 0(f(k)n c ) where / 
is a computable function, c is a constant, and n denotes the size of (I, k). The parameterized complexity class 
W[l] is considered as the parameterized analog to NP. For example, the parameterized Maximum Clique problem 
(given a graph G and a parameter k > 0, does G contain a complete subgraph on k vertices?) is W[ Incomplete 
under parameterized reductions. Note that there exists a trivial non-uniform polynomial-time n k algorithm for 
the Maximum Clique problems that checks all sets of k vertices. FPT ^ W[l] is a widely accepted complexity 
theoretic assumption Downey and Fellows (1999). For example, FPT = W[l] implies the (unlikely) existence 
of a 2°( n ) algorithm for n-variable 3SAT Impagliazzo, Paturi, and Zane (2001); Flum and Grohe (2006). A first 
parameterized analysis of probabilistic network structure learning using structural parameters such as treewidth 
has recently been carried out by Ordyniak and Szeider (2010). 

The algorithm from Theorem 5 considers 0(n 3k ) relevant choices for the set S = D U D', and for each fixed 
choice of S the running time is polynomial. Thus, for restrictions of the problem for which the enumeration of 
all relevant sets S is fixed parameter tractable, one obtains an FPT algorithm. One such restriction requires that 
S = D U D' is an in-tree, i.e., a directed tree where every arc is directed towards a designated root, and each node 
has a bounded number of potential parent sets. 

Theorem 6. The k-branching problem is fixed-parameter tractable if we require that (i) the set S = D U D' of 
arcs is an in-tree and ( ii) each node has a bounded number of potential parent sets. 

Proof. To compute a fc-branching A, the algorithm guesses its deletion set D and the set D' = {(u, v) € A \ D : 
v e H(D)}. As A is a fc-branching, \D\ < k and for every v e H(D) there is at most one arc in D' with head 
v. The algorithm first guesses the root r for the in-tree S. Then it goes over all possible choices for D and D 1 as 
follows, until D has at least k arcs. 

Guess a leaf I of S (initially, r is the unique leaf of S), and guess a non-empty parent set P for I in A. If 
\D\ + \P\ + 1 > k, then backtrack. Otherwise, choose at most one arc (p, £) to add to D', where p G P, and add 
all other arcs from a node from P to I to D (if \P\ = 1, no arc is added to D'). Now, check whether the current 
choice for £ = D U D' leads to a fc-branching by checking whether S is acyclic and using the matroids Mj (S) 
and M2(S) as in Theorem 5. 

There are at most n choices for r. The in-tree S is expanded in at most fc steps, as each step adds at least 
one arc to D. In each step, I is chosen among at most fc + 1 leaves, there is a constant number of choices for its 
parent set P and at most fc + 2 choices for adding (or not) an arc (p, £), with p 6 P, to D 1 (as \P\ < k + 1). The 
acyclicity check for S and the weighted matroid intersection can be computed in time 0(n 4 ), leading to a total 
running time of 0(k 2k c k n 5 ), where c is such that every node has at most c potential parent sets. □ 

Condition (i) in Theorem 6 may be replaced by other conditions requiring the connectivity of D or a small 
distance between arcs from D, giving other fixed-parameter tractable restrictions of the fc-branching problem. 

The following theorem shows that an exponential dependency on fc or some other parameter is necessary since 
the fc-branching problem remains NP-hard under the restrictions given above. 

Theorem 7. The k-branching problem is NP-hard even if we require that (i) the set S — D Li D' of arcs is an 
in-tree and ( ii) each node has at most 3 potential parent sets. 

Proof. We devise a polynomial reduction from 3-SAT-2 a version of 3-SATISFIABILITY where every literal 
occurs at most in two clauses. 3-S AT- 2 is well known to be NP-hard Garey and Johnson (1979). Our reduction 
uses the same ideas as the proof of Theorem 6 in Dasgupta (1999). Let $ be an instance of 3-S AT- 2 with clauses 
C\, . . . , C m and variables x\, . . . , x n . We define the set N of nodes as follows. For every variable Xi in $ the 
set contains the nodes p i ,Xi 1 x\ 1 x\, x\ and xf . Furthermore, for every clause Cj the set N contains the nodes 
p n +j and Cj. Let 1 < i < n, 1 < j < m, and 1 < I < 2. We set f{Cj,x[) = 1 if the clause Cj is the Z-th 
clause that contains the literal xi. Similarly, we set f(Cj, x\) = 1 if the clause Cj is the Z-th clause that contains 
the literal^. We set /(x i; {xj, xf}) = f(x l ,{x\,x 2 i }) = 1, f{pi,{xi}) = f(pi,{xi,pi-i}) = 1 for every 
1 < i < n, and f(p n+ j, {Cj,p n+ j-i}) = 1 for every 1 < j < m. Furthermore, we set f(v, P) = for all the 
remaining combinations of v G N and P C N . This completes the construction of N and /. Observe that every 
node of N has at most 3 potential parent sets. This completes our construction. We will have shown the theorem 
after showing the following claim. 
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Figure 3: An optimal 2n + m-branching D for the formula $ = d A C 2 A C 3 with Ci = xi V x 2 V x 3 , 
C 2 = V x 2 V xs, and C3 = x\ V x 2 V x 3 according to the construction given in the proof of Theorem 7. 

Claim: $ is satisfiable if and only if there is a 2n + m-branching D such that f(D) > 2(m + n), the set 
S = D U D' of arcs is an in-tree, and each node of N has at most 3 potential parent sets. 

{=>) : Suppose that the formula <f> is satisfiable and let j3 be a satisfying assignment for <f>. Furthermore, for 
every 1 < j < m let lj be a literal of Cj that is set to true by (3. We construct a 2n + m-branching D as follows. 
For every 1 < j < m the digraph D contains an arc (x\, Cj) if lj — Xi and Cj is the Z-th clause that contains 
Xi and an arc (x\, Cj) if lj = Xi and Cj is the Z-th clause that contains Xj for some 1 < i < n and 1 < I < 2. 
Furthermore, for every 1 < i < n the digraph D contains the arcs (xj,Xi) and (x?,Xi) if /?(xj) — false and the 
arcs (xj,Xi) and (xf,Xi) if /3(xj) = true. Last but not least D contains the arcs (xi,Pi), (Cj,p n+ j) and (pi,pi +1 ) 
for every 1 < i < n, 1 < j < m, and 1 < I < m + n. Figure 3 shows an optimal 2n + m-branching D for 
some 3-SAT-2 formula. It is easy to see that D is a 2n + m-branching such that ,f{D) = 2(m + n) and the set 
S = D U D' of arcs is an in-tree. 

(<=) : Suppose there is a 2n + m-branching D such that /(D) > 2(m + n). Because f(D) > 2(m + n) 
it follows that every node of N achieves its maximum score in D. Hence, D has to contain the arcs (x i} pi), 
(Cj,p n+ j), (pi,pi + \), for every 1 < i < n, 1 < j < m, and 1 < I < m + n. For the same reasons D has to 
contain either the arcs (xj 7 x,) and (xf, Xi) or the arcs (x], Xi) and (xf,Xi) for every 1 < i < n. Furthermore, 
for every 1 < j < m the 2n + m-branching D has to contain one arc of the form (x\, Cj) or (x\, Cj) where 
Cj is the l-th clause that contains Xi or x i7 respectively, for some 1 < i < n and 1 < I < 2. Let 1 < i < n, 
1 < j < m, and 1 < I < 2. We first show that whenever D contains an arc (x\ , Xi) then D contains no arc of the 
form (x\, Cj) and similarly if D contains an arc (x\, Xi) then D contains no arc of the form (xi, Cj). Suppose 
for a contradiction that D contains an arc (x • , Xi ) together with an arc {x\ , Cj ) or an arc (x l il x i ) together with an 
arc {x\ , Cj ) . In the first case D contains the undirected cycle (x\ , x% , Pi , . . . , p n +j , Cj , x\ ) and in the second case 
D contains the cycle (x\,Xi,pi, . . . ,p n +j,Cj,x[) contradicting our assumption that D is a 2n + m-branching. 
It now follows that the assignment (3 with f3(xi) = true if D does not contain the arcs (xj , Xi) and (xf , Xi) and 
(3{xi) — false if D does not contain the arcs (x},Xi) and (xf , Xi) is a satisfying assignment for □ 

So far we have measured the difference of a polytree to branchings in terms of the number of arcs to be 
deleted. Next we investigate the consequences of measuring the difference by the number of nodes to be deleted. 
We call a polytree A a k-node branching if there exists a set of at most k nodes X C A such that A \ X is a 
branching. The k-node branching problem is to find a fc-node branching A that maximizes f(A). Clearly every 
fc-branching is a fc-node branching, but the reverse does not hold. In other words, the fc-node branching problem 
generalizes the fc-branching problem. 

In the following we show that the fc-node branching problem is hard for the parameterized complexity class 
W[l]; this provides strong evidence that the problem is not fixed-parameter tractable. 
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Figure 4: An example graph G (k — 3) together with an optimal fc'-node branching D with f{D) > s according 
to the construction given in the proof of Theorem 8. 

Theorem 8. The k-node branching problem is W[l]-hard. 

Proof. We devise a parameterized reduction from the following problem, called Partitioned Clique, which is 
well-known to be W[l]-complete for parameter k Pietrzak (2003). The Instance is a fc -partite graph G = (V, E) 
with partition Vi, . . . , Vfc such that \V~i\ = n for every 1 < i < k. The question is whether there are nodes 
vi, . . . ,Vk such that u, 6 for 1 < i < k and {i)i,Vj} G E for 1 < i < j < fc? (The graph K = 
({v\, . . . , Vk}, { {vi, vj} : 1 < i < j < k }) is a k-clique of G.) 

Let G = (V,E) be an instance of this problem with partition Vi,... , Vfc, |Vi| = ••• = |Vfe| = n, and 
parameter k. Let fc' = ( 2 ) + k, a = 1, and s = k'a. Let A = { dij : 1 < i < j < k } and Ai = { aik G A : I = i 
or k = i } for every 1 < i < k. Then N is defined as N = A U {ci, . . . , c^} U { v 1 , . . . , v k : v G V }. Let 
V™ = { v 1 , . . . , v k : v G Vi and w ^ w }. We define the score function / as follows. We set /(cj, A, U V™) = a 
for every 1 < i < k and w G Vi, and /(ay, {« J ,w'}) = a for every 1 < « < j < k, u G Vi, w G V,, and 
{u, w} G E(G). Furthermore, we set f(v, P) = for all the remaining combinations of v and P. This completes 
our construction. We will have the theorem after showing the following claim. 

Claim: G has a k-clique if and only if there is a k' -node branching D such that f(D) > s. 

(=>) : Suppose that G has a fc-clique K. Then it is easy to see that the DAG D on N defined by the arc set 
{ {v j ,aij), {v j ,aji) : v G V(K) n V and 1 < i, j < fc} U { (v\aij), {v^aji) : v G V{K) n Vj and 1 < i,j < 
k } U { (ay, Ci) :l<i<j<fc}U{ (a y , c 3 ) : 1 < i < j < k } U { {v j ,Ci) : v G V \ {{J e eE(K) e ) and 1 ^ 
i, j < fc } is a fc'-node branching and f(D) = s. Figure 4 shows an optimal fc'-node branching D constructed 
from an example graph G. 

(<=) : Suppose there is a fc'-node branching D with f(D) > s. It follows that every node of D achieves its 
maximum score. In particular, for every 1 < i < fc the nodes Cj must have score a in D and hence there is a node 
Wi G Vi such that Cj is adjacent to all nodes in V™* U A t . Furthermore, for every 1 < i < j < fc the node a^ 
is adjacent to exactly one node in Vj and to exactly one node in Vj. Let t>' be the unique node in Vj adjacent to 
a,j and similarly let uj" be the unique node in Vj that is adjacent to ajj for every 1 < z < j < fc. Then Wj = «j 
and = Vj because otherwise the skeleton of D would contain the cycle (vi, atj, Cj) or the cycle (vj,dij,Cj). 
Consequently, the edges represented by the parents of aij in D for all 1 < i < j < k form a fc-clique in G. □ 

5 Concluding remarks 

We have studied a natural approach to extend the known efficient algorithms for branchings to polytrees that 
differ from branchings in only a few extra arcs. At first glance, one might expect this to be achievable by simply 
guessing the extra arcs and solving the remaining problem for branchings. However, we do not know whether 
such a reduction is possible in the strict sense. Indeed, we had to take a slight detour and modify the two matroids 
in a way that guarantees a control for the interactions caused by the presence of high-in-degree nodes. As a result, 
we got an algorithm that runs in time polynomial in the input size: namely, there can be more than relevant 
input values for each of the n nodes; so, the runtime of our algorithm is less than cubic in the size of the input, 
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supposing the local scores are given explicitly. While this answers one question in the affirmative, it also raises 
several further questions, some of which we give in the next paragraphs. 

Our complexity analysis relied on a result concerning the general weighted matroid intersection problem. 
Do significantly faster algorithms exist when restricted to our two specific matroids? One might expect such 
algorithms exist, since the related problem for branchings can be solved in 0(n 2 ) time by the algorithm of Tarjan 
(1977). 

Even if we could solve the matroid intersection problem faster, our algorithm would remain practical only 
for very small values of fc. Can one find an optimal fc-branching significantly faster, especially if allowing every 
node to have at most two parents? As the current algorithm makes around n 3k mutually overlapping guesses, 
there might be a way to considerably reduce the time complexity. Specifically, we ask whether the restricted 
problem is fixed-parameter tractable with respect to the parameter fc, that is, solvable in 0(f(k)p(n)) time for 
some computable function / and polynomial p Downey and Fellows (1999). The fixed-parameter algorithm given 
in Section 4 can be seen as a first step towards an answer to this question. Can we find other restrictions under 
which the fc-branching problem becomes fixed-parameter tractable? 

Can we use a similar approach for the more general fc-node branching problem, i.e., is there a polynomial time 
algorithm for the fc-node branching problem for every fixed fc? Likewise, we do not know whether the problem 
is easier or harder for polytrees than for general DAGs: Do similar techniques apply to finding maximum-score 
DAGs that can be turned into branchings by deleting some fc arcs? 
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